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We develop a hybrid type of quantum anneahng in which we control temper- 
ature and quantum field simultaneously. We study the efficiency of proposed 
quantum annealing and find a good schedule of changing thermal fluctuation 
and quantum fluctuation. In this paper, we focus on clustering problems which 
are important topics in information science and engineering. We obtain the bet- 
ter solution of the clustering problem than the standard simulated annealing 
by proposed quantum annealing. 
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1. Introduction 

Optimization problems have spread in wide area of science, for example, 
information science and statistical physics. Properties of optimization prob- 
lems can be summarized as follows: 



• There arc many elements. 
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• A cost function can be defined. The best solution is the state where 
the cost function takes the maximum value or the minimum value 
depending on the definition of the given problem. 

An well-known example of optimization problem is the so-called "traveling 
salesman problem" . The traveling salesman problem is to find the shortest 

path covering all the cities. Here, we can path through each city only once. 
In this problem, the cost function is the length of path. In general, it is 
difficult to obtain the best solution by naive approach since the number of 
candidates of a solution is very huge. Then, in the optimization problem, it 
is a central issue how to find the best (or better) solution. There are many 
types of optimization problems. Depending on an optimization problem, 
there are specific algorithms to solve the problem. 

To treat optimization problem in a general way, on the other hand, a 
couple of approaches have been developed from a viewpoint of statistical 
physics. The cost function can be regarded as the internal energy in sta- 
tistical physics. The cost function of the best solution corresponds to the 
internal energy of the ground state. Kirkpatrick et al. proposed a pioneer- 
ing generic algorithm - simulated annealing. ^'■^ Simulated annealing is a 
method to obtain not so bad solution of optimization problem by decreas- 
ing temperature. In high temperature, since the probability distribution 
at equilibrium state is almost flat, the state can be changed easily. As we 
decrease temperature gradually, generated probability distribution expects 
to approach the equilibrium probability distribution at each temperature. 
Then we can find not so bad solution of problem by simulated anneal- 
ing. Because of the Geman-Geman's argument,^ we can succeed to obtain 
the best solution with the probability unity if we decrease the tempera- 
ture slow enough. Although there could be better algorithms specific to 
each optimization problem, simulated annealing is regarded as a stable and 
generic method because of easy implementation and independency from 
problems. The purpose of our study is to establish a good general method 
for optimization problems from a viewpoint of quantum statistical physics. 

The organization of this paper is as follows. We first review on quantum 
annealing which has been believed a good general method to obtain not so 
bad solution of optimization problems. In section 3, we will review how to 
implement Monte Carlo simulation. In section 4, we will introduce a model 
which can treat clustering problems. We also consider quantum effect on 
this model in this section. In section 5, some remarks on this problem will be 
shown. In section 6, we will show results obtained by our proposed method. 
In section 7, we will summarize our study. We will review on the concepts 
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of "invisible fluctuation" in the appendix A. 
2. Quantum annealing 

Quantum annealing is expected that it succeeds to obtain the better solu- 
tion of optimization problems than simulated annealing. This method is 

based on quantum statistical physics. There have been a couple of realiza- 
tion methods of quantum annealing: (i) stochastic method, (ii) deterministic 
method, and (iii) experiment on artiflcial lattice. 

The stochastic method is realized by quantum Monte Carlo method in 
many circumstances. "^'^^ The quantum Monte Carlo method is an estab- 
lished method to obtain equilibrium properties of strongly correlated quan- 
tum systems. Efficient algorithms for quantum annealing have been devel- 
oped such as the cluster algorithm. ^"^^^^ Owing to these masterly methods, 
quantum Monte Carlo simulation can be adopted for large-scale systems. 

There are a couple of deterministic methods for quantum annealing. 
The first one is based on the time-dopendcnt Schrodingcr equation.^ This 
method can trace the real-time evolution which can be observed in real ex- 
periments. This method is called as "quantum adiabatic evolution" in quan- 
tum information science.^ This method cannot treat large-scale systems 
because of the limitation of size of memory in computer. The second one is 
based on a time-dependent density matrix renormalization group. ^^'^^ By 
this method, we can study time-evolution of one-dimensional quantum sys- 
tems. However it is difficult to treat two-dimensional or three-dimensional 
quantum systems by this method. Both of two methods are based on the 
principle of quantum mechanics. However, it is not necessary to treat the 
Schrodinger equation directly if we adopt the deterministic method as quan- 
tum annealing. This is because our purpose is to obtain not so bad solution 
of given problem. One of the examples is mean-field calculation.^^'^^ This 
method can treat large-scale systems as well as the quantum Monte Carlo 
simulation. Then, this method has been widely adopted for optimization 
problems. 

Next there are a couple of proposals for experiments on artificial lattice 
for quantum annealing. For example, we can generate quantum state by 
optical lattice. ""^^"^^ In addition, the Ising model with transverse field can be 
realized by superconducting flux qubits.^^"^^ These methods are expected 
as a new type of quantum computer. 

There have been a number of studies on quantum annealing from a view- 
point of theoretical physics. Convergence theorem for quantum annealing 
was proved as well as that for simulated annealing. ■^''"^^ According to this 
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theorem, the permitted upper bound of sweeping speed of quantum field in 
quantum annealing is larger than that of temperature in simulated anneal- 
ing. Prom this theorem, quantum annealing seems better than simulated 
annealing in principle. However, we often sweep temperature and/or quan- 
tum field faster than that upper bound in practice. Then, it is nontrivial 
whether quantum annealing is better than simulated annealing from a view- 
point of practical situation. Microscopic behavior in quantum annealing has 
been also investigated. It is well-known that frustrated systems have some 
interesting static behavior induced by thermal fluctuation and quantum 
fluctuation. Dynamical properties of frustrated systems have been studied 
from a viewpoint of quantum annealing. ^""^^ Novel type of implementation 
of quantum annealing itself is an important topic. One of the examples 
is quantum annealing based on the Jarzynski equality.'^^ This method is 
expected an efficient method since it uses both merits which come from 
thermal fliictuation and qiiantiim fluctuation. 

In this paper, we adopt the first strategy - the quantum Monte Carlo 
method as the realization of quantum annealing. 



3. Monte Carlo simulation 

In this section, we review how to implement Monte Carlo simulation. It has 

been often used in order to obtain the equilibrium properties of strongly cor- 
related systems such as magnetic systems and bosonic systems. Equilibrium 
physical quantities of the system which is expressed by the Hamiltonian H 
at finite temperature T is given as 

where /? denotes the inverse temperature 1/T and here the Boltzmann 
constant fce is set to be unity. If we consider a small system, we can obtain 
all of the equilibrium physical quantities by naive method. If we consider 
large scale systems, however, wc cannot obtain the equilibrium properties 
by naive method in practice. Monte Carlo simulation enables us to calculate 
equilibrium physical quantities with high accuracy by the following relation. 

where E denotes sample, in other words, state. Physical quantity converges 

to the equilibrium value as the number of samples increases. In fact, the 
above calculation is inefficient if states are generated by uniform distribu- 
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tion. In order to make the method more efficient, we generate states ac- 
cording to equihbrium probabihty distribution which is proportional to the 
Boltzmann factor e"''^^^). This method is called the importance sampling. 
We can obtain the equilibrium value as follows: 

^^^{0)gl (3) 

In order to generate a state from the equilibrium distribution, we just have 
to use Markov chain Monte Carlo method. Time evolution of probability 
distribution is given as the master equation: 

P{-Si,t + At)= -^P(Si,t)u;(S,|Si)Ai 

+ P{Ej,t)w{Ei\Ej)At + P{i:i, i)«;(Si|Si)Ai, (4) 

where P(Ej,t) denotes probability of the state Sj at time t and 
represents transition probability from the state S, to the state Ej in unit 
time. Transition probability w(Sj|Sj) obeys 

^^S,|E0 = 1 (VS,). (5) 

The master equation can be represented as 

P(t + At) (6) 

where P{t) is a vector-representation of probability distribution {P(Ej,t)} 
and C is called the transition matrix whose elements are expressed as 

Cji = w{Y;j\Ei)At, £ii = l-^/:j, = l-^w(I]j|Si)At. (7) 

It should be noted that £ is a non-negative matrix by the definition. This 
time evolution is the Markovian since the time-evolution operator C does 
not depend on time. If the time-evolution operator C obeys (i) detailed 
balance condition and (ii) ergordicity, we can obtain the equilibrium prob- 
ability distribution in the limit oi t —>■ oo because of the Perron- Frobenius 
theorem. 



4. Model 

Clustering problem is one of the important problems in information sci- 
ence and engineering. Since it is difficult to obtain the best solution of the 
clustering problem by naive method, development of a new method which 
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can obtain the best (or not so bad) solution is an important issue. We can 
obtain not so bad solution by using simulated annealing as we mentioned 
in the section 1. We propose a new type of quantum annealing and succeed 
to obtain better solution by proposed quantum annealing method as will 

be mentioned. 

In the beginning of this section, we will explain a model to consider 
clustering problems. After that, we will introduce new kind of fluctuation - 
quantum fluctuation - into this model. Next, we will review on implemen- 
tation method of the quantum annealing method. 

4.1. Clustering problem 

In clustering problems, there are N elements in the space, which is depicted 
in Fig. 1(a). 
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Fig. 1. The dots represent elements. The dotted boxes denote clusters. In these figures, 
the number of elements N = 69 and the number of clusters Q = 4. (a) There are A'^ 
elements in the space, (b) Not so good solution, (c) Not so bad solution, (d) The best 
solution. 



Clustering problems are to decide which the best partition of these ele- 
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ments. In other words, clustering problems are to seareh the best division of 
elements into Q sub-categories. Clustering problems have been applied 
for not only natural science but also social science. For instance, divisions of 
the articles in newspaper in terms of contents and analysis of questionnaire 
can be considered as clustering problems. Figure 1 (b), (c), and (d) repre- 
sent not so good, not so bad, and the best solution, respectively. In practice, 
it is difficult to obtain the best solution by direct method for huge number 
N, since the number of total states is . Then we often use simulated 
annealing to solve clustering problems as one of useful generic methods. 
However, even if we use simulated annealing, it is difficult to obtain the 
best solution. The; encirgy landscape of clustering problems is very compli- 
cated such as random spin systems and frustrated systems. Then we should 
develop more efficient algorithm than the simulated annealing method. Ac- 
tually, there is a pioneering method to obtain more better solution than 
simulated annealing method. This method is called exchange method. '^^ 
In the exchange method, we prepare some independent layers where the 
temperatures are different. We sometimes exchange the states between two 
layers according to the Boltzmann weight. In this paper, we adopt another 
strategy, quantum annealing method as more efficient algorithm. We intro- 
duce quantum term into this model to implement the quantum annealing 
in the next section. 



4.2. Quantum fluctuation 

Before introducing a quantum fluctuation, we review on a classical Hamil- 
tonian of the original clustering problem. The classical Hamiltonian is given 



by 



7^c = diag (£;(Si),£;(S2),--- 



(8) 



where -E(Si) denotes the eigenencrgy of i-th state S^. Suppose wc consider 
the case for Q = 3 and A?^ = 2 as an example. The classical Hamiltonian is 
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given as 
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(9) 



where the definitions of the states from Si to Sg are summarized in Table 
1. (Ti and (72 in Table 1 represent the states of the first element and the 
second element, respectively. 



Table 1. Definitions of the states from Si to Sg. 









S3 S4 


S5 


Se S7 Ss S9 




1 


2 


3 1 


2 


3 12 3 




i 


1 


1 2 


2 


2 '■'> ^5 .'i 



Next we introduce a quantum fluctuation into this model. Since when 
(5 = 2, the classical Hamiltonian is equivalent to that of the Ising model. It 
is natural to extend transverse field in the Ising spin system as a quantum 
fiuctuation. Then we adopt the following definition as a quantum fluctua- 
tion. 



AT 



N 



(10) 



i=l 



where Ig denotes the matrix whose all elements are unity and Eg represents 
identity matrix. Both Ig and Eg are Q x Q matrices. It should be noted 
that af denotes the x-component of s = 1/2 Pauli matrix at the site i when 
Q = 2. Suppose we consider the case for Q = 3 and = 2 as the previous 
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example, the quantum part of the Hamiltonian Hq is given as 



(11) 



/ -r -r -r o -r o o \ 
-r -r -r o -r o 
-r -r -r o -r 
-r -r -r -r o 
nq= -r -r -r -r 
-r -r -r o o -r 
-r -r o -r -r 
-r -r -r -r 
V -r -r -r -r / 

When wc use riuantum Monte Carlo method, all we have to do is to calculate 
the probability of the state S. In the next section, wc will show how to 
calculate the probability of the state S by path-integral representation.^^'^^ 



4.3. Path integral representation 

We consider the following Hamiltonian 

n = nc + Hq. (12) 

When the Hamiltonian H is a diagonal matrix which corresponds to simu- 
lated annealing, i.e. F = 0, the probability of the state S at finite temper- 
ature T is given as 

p-/3£;(E) 1 

Psa(S;/3) = T^^^zm: = ^(S|e-''«1S), (13) 

where the denominator is called the partition function in statistical physics. 
The partition function Z is calculated as 

Z = Tve-^^^ = ^(S|e-'^«"|S) = ^e-'^^(^). (14) 

E E 

We can change the state by using the "single-spin-fiip" type of heat bath 
method, 

pZ^'^'^cTi = s|S\a,) = '^'^'"'^ , (15) 

where T,\ai means {<Tj\j ^ i} and p{A\B) denotes a conditional probability 
of A given B. The denominator of Eq. (15) can be calculated where the 
computational cost is 0{Q). 
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In similar with the classical case, the probability of the state S at finite 
temperature T and finite quantum field F is given as 

Pqa(S, /3, r) - ^-^^^^-^^ . (16) 

Note that it is difficult to calculate (S|e~'^^|S), since the Hamiltonian in- 
cluding quantum field has off-diagonal elements. For small systems, we can 
exactly calculate all of the elements e~^'^ by using the unitary transform. 
However, we can not obtain them for large system in practice. In order to 
calculate the probability given by Eq. (16), we should rewrite the numerator 
of it by path-integral representation. Then we obtain the probability 

PQA(S;/3,r) = 1(E| (e-^«=e-^^<>)'"|I]) + 0(l) 

S(l)' S(m)' j = l 

where m is called as the Trotter number and S*^^^ = = S which 

corresponds to periodic boundary condition along the Trotter axis. 
Here we define 

a(E«,E(^+i)):=l5:^(a«,ar^)), (18) 

i=l 

f{(3, T) := N log (^1 + J , (19) 

where o-p^ denotes the i-th element on the j-th Trotter layer. Then we can 
obtain 

(S«)|e--'^-|EW) oc PSA (^S^^'^; ^ S (s(^\e(^)') , (20) 
(S^'le-^^-'lS^^+i)) oc e«(^"''.^"+^')/('5'r), (21) 
since He is a diagonal matrix and the equation 

{aff = (iQ - EqY = ^ [(1 - QY - 1] Iq + (-I)'Eq (22) 

is satisfied. Then we obtain 

1 "* « 

PQA(S;/3,r) = ^E--- E nfSA(SW;^)e«(^"''^""^^)/(^>^). (23) 

E(2) EC*") j=l ^ 
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Trotter axis 



PQA(S;Ar): 



Trotter axis 



Trotter axis 





Ed) = S 

Fig. 2. Conceptual diagram of path integral. In order to calculate the probability of 
the state E of quantum system, we add an extra dimension which is called the Trotter 
axis. The probability pQA(E;/3,r) can be calculated by talking sum of configurations of 
path depicted by the bold line. 



By using path-integral representation, the probability of the state S of 
the (i-dimcnsional quantum system can be represented by that of {d+ 1)- 
dimensional classical system approximately (see Fig. 2). 

Here it should be noted that the function /(/3,r) is a monotonic de- 
creasing function of inverse temperature /3 and quantum field F (see Fig.3). 
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Fig. 3. F) as a function of F for AT = 5000, Q = 30, and m = 50. 
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In the quantum Monte Carlo simulation by using path-integral repre- 
sentation, we update the state according to the probability as follows: 

fQ-k'-STl^F^ = a|S«\a«,E0-i),E0+i);/3,r) = (24) 

E?=i^(s ) 

P{s) = e{-|:J5(si'■')+Ks""'^si'^)+«(s?■^s''■+'^)l/(/3.^)}^ (25) 
where si''^ represents the state crp' = s given Y.^^\a'f \ 



5. Some remarks 

In the section 4, we introduced the quantum fluctuation and obtained the 
probability of the quantum state by the path-integral representation. Some 
remarks will be shown in this section. 



5.1. Labels of the clusters 

The value s{'S^^\'E^^~^^^) given by Eq. (18) expresses the correlation func- 
tion between the state on the j-th Trotter layer and that on the j + 1-th 

Trotter layer. When the divisions are the same but the labels of the clusters 
are completely different as shown in Fig. 4, s(E(-'\ S(-'+^)) becomes zero. 
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(a) (b) 

Fig. 4. Both (a) and (b) are the same divisions but the name of the clusters are com- 
pletely different. 



When we decrease the quantum fleld T slow enough, not only division 
but also the labels of the clusters should become the same. If the state on 
the i-th layer is S^^) = Ea and that on the j + 1-th layer is = Eb 

such as Fig. 4 by accident, we cannot only gain a benefit but also are 
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faced with a problem by introducing path-integral representation. This is 
similar situation with the domain wall problem in the ferromagnetic Ising 
model. In the ferromagnetic Ising model without external magnetic field, 
the ground state is that all spins are up or down. Suppose we consider the 
case that the spins in the left half arc up and the spins in the right half are 
down. If we execute the standard Monte Carlo simulation, it is difficult to 
obtain the stable state {i.e. the ground state) from such an initial state for 
a short time. For clustering problems, the domain wall problem gets more 
seriously comparing with the standard ferromagnetic Ising model since the 
number of the same divisions is Q\. In order to avoid the domain wall 
problem, we introduce a new parameter "modified correlation function" 
and approximate the probability distribution. The definition of the modified 
correlation function s(S(-'\ S'''+^)) is 

5(SO-),SO-+i)) := ^^maxe'=i,..,,[y(S«)F^(E(^+i))]e,c', (26) 

c=l 

r(EW):=(a(^\a(-'\...,a«), (27) 

where y(I](-')) denotes N xQ matrix. For Q = 3 and TV = 2, ^(Ss), where 
S3 was defined in Table 1, is given as 

/01\ 

Yi^s) ^^^^ 

It should be noted that the correlation function s(T,^^\T,^^~^^^) can be ex- 
pressed in a similar way such as 

s(5]0-),E(j'+i)) = i^Tr[y(S(j'))r^(E(j'+i))]. (29) 

Let us show sonic properties of the modified correlation function com- 
paring s(S(^\ S(^+^') in the next section. Suppose we consider the case for 
Q = 3 and N = 7 in the section 5.1.1 and 5.1.2. 

5.1.1. Example A 

We consider a case that the states on the j-th layer and that on the j + 1-th 
layer are shown in Table 2. 

Here, y(S(j)) and F(S(J+i)) are given as 

/I 1 1 0\ /O 1 1 0\ 

y(s(-')) =0001100, y(E(-'+i)) =0010001. (30) 
Vo 000011/ Viioiooo/ 
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Table 2. Example A. 
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Then, the modified correlation function is calculated as 

3 

5(SO),eO+i)) = -^max,,=i,„,3 10 1 = -. (31) 

c=i V 1 1 0/ , 

^ ' c,c' 

Note that s(S(j), S^J+i)) = by the definition. Next we fix the labels of 
and rename the labels of E^^+i). The values of s(i;(j), "P^E^^+i)) are 
shown in Table 3, where V^^ (tt = I,-- - ,Q!) denotes label permutation 
operator. 



Table 3. The value of correlation function by applying label permutation 
operator V-jr for example A. 
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In this case, the maximum value of s(S(j), 7'^S(J+i)) is the same as the 
value of modified correlation function s(E*^-'\ E*^-'+^)). 

5.1.2. Example B 

We also consider another case that the states on the j-th layer and that on 
the j + 1-th layer are shown in Table 4 as an another example. 



Table I. Exaiiii)k* B. 
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Here, y(S(^)) and Y{T,^i+^^) are given as 

/1 o\ /o 1 1 1 1 r 

=0111100, y(s(^+i)) = 1 I . (32) 

VO 000011/ VO 00010 0, 

Then, the modified correlation function is calculated as 

= -5]maxe.=i,..,3 3 1 = ^. (33) 

Note that s(S(j\ S(J+i)) = by the definition. As in the section 5.1.1, 
when we fix the labels of S^-'^ and rename the labels of S(-'+^^, the values 
of s(I](^\P„S(J+i)) are shown in Table 5. 



Table 5. The value of correlation function by applying label permutation 
operator for example B. 
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3 


3 


1 


3 


3 


2/7 



In this case, the maximum value of s(EW),P^S(J+i)) is not the same 
as the modified correlation function s{T,^^\'E^^~^^'>). Although the modi- 
fied correlation function expresses some kinds of similarity between the 
states and it is not necessary commutative i.e. s(S^^\ ^ 

s(Z;W+i), EW)). ^j^g pj^gg Qf example A, 

= S(e(^'+i),e(J')) = ^. (34) 
On the other hand, in the case of example B, 

s(EW,E(^'+i)) = ^, s(E(^'+i),S«) = ^. (35) 
Here we summarize the properties of the modified correlation function: 
• The modified correlation function does not necessary commute. 

g(£a)^£a+i)) ^ s(i]0+i),s(^)). (36) 
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• For any (tt = 1, ■ • ■ , Q!), the inequality 

< s(i](^'\7'^s(J'+i)) = Tr [y(i](^'))y(KS^^'+^')^l < < 1 

is satisfied. 

Although the modified correlation function is not necessary the same as 
the maximum value of s(S(^),P^E;(J+i)), wc use this value in our proposed 
algorithm. This is because our aim is to find not so bad solution of given 
problem as mentioned before. 



5.2. Speed-up by Using Modified Correlation Function 

When we adopt the quantum Monte Carlo simulation by the path-integral 
representation, it is enough to update the state according to Eq. (24) in 
principle. In the previous section, wc introduced the new parameter "mod- 
ified correlation function" to avoid the so-called domain wall problem. In- 
stead of Eq. (24), we adopt the following the transition probability based 
on modified correlation function: 



update f (j) _ \UJ)\ ij) vO-i) y;0+i)-« D 



exp 




^-E{Y.^i^) + ^(s(^-i), sO'+i) )/(/?, r)' 




Et=i cxp 





5(S(^-i\ S(^+i)) := s(E(^-i\ I](^)) + S(I](^\ I](^+i)). 



(37) 
(38) 



Since §(!](•' S^-'^) is not necessary commutative, there are four possibil- 
ities of the definition of 5(S(^-i), S(^), as follows: 

S(S(^'-i),I]«,I](j'+i)) = s(E(^-i),E(j)) + S(I](j),E(J+i)), (39) 

5(S(^-i),eW,E(^'+i)) = s(E(^'-i),E(^)) + §(E(^+i),E(^)), (40) 

5(E(^-i\eW,E(^'+i)) = s(E(-''\e(^-i)) + s(E(^'\e(^+i)), (41) 

S(E(^-i\eW\e(^'+i)) = s(E(-''\e(^-i)) + §(E(^+^),E(^')). (42) 



For small systems we confirm that when we adopt Eq. (39) as the defini- 
tion ofS(E«-i),E(^\EO+i)), we can obtain better solution than the other 
choices. Then, we adopt the relation given by Eq. (39) as the definition of 

5(E(^-i),Ei^\E(j'+i)). 
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5.3. Simultaneous Control of Thermal Fluctuation and 

Quantum Fluctuation 

In our algorithm given by Eq. (37), thermal fluctuation /3/m and quantum 
fluctuation /(/3, F) coexist. Then, it is expected that we can obtain the bet- 
ter solution than simulated annealing by controlling the thermal fluctuation 
and quantum fluctuation simultaneously with ingenuity. 

In order to investigate how to control both of the thermal fluctuation and 
quantum fluctuation, we flrst consider limiting cases. When the temperature 
is very low than the quantum field, i.e. /3/m ^ /(/3,r), the probability 
distributions in each layer obey Eq. (13). On the other hand, if the quantum 
field is very weak, i.e. fi/m <C f{j3,T), the state in each layers becomes 
the same state such as E = E^-'' for all j. By calculating small systems, 
we find how to control the thermal fluctuation and quantum fluctuation 
simultaneously. The best condition is as follows: (i) At the flrst step, to find 
the metastable state on each layers, we set the temperature /3/m ^ f{(3, T). 
(ii)After that, /(/?, F) ovcrstrides thermal fluctuation. The curve /* in Fig. 5 
shows a conceptual diagram for our proposed schedule. 




r 



Fig. 5. The dotted curve denotes the schedule of inverse temperature, /i is too rapid 
for decreasing the quantum field. On the other hand, /2 is too slow for decreasing the 
quantum field. /* denotes the best schedule. 

The dotted curve in Fig. 5 indicates the schedule of cooling temperature 
which corresponds to the simulated annealing. The curve depicted /* in 
Fig. 5 is the best schedule for obtaining better solution than the simulated 
annealing. In the schedule depicted /i in Fig. 5, quantum mixing effect does 
not make sense. In the schedule depicted /2 in Fig. 5, on the other hand, 
the states on each layers behave independently. The quantum fluctuation 
effect is strong than the thermal fluctuation effect in the schedule depicted 
/2. It is essentially the same as the simulated annealing. 

Here we assume the scheduling functions of temperature and quantum 



January 19, 2013 21:47 



WSPC - Proceedings Trim Size: 9in x 6in ShuTanaka submitted'20110417 



18 

field as follows: 

m = A)4' (43) 

T{t) = oo, {t<T), r(i)=roexp(-4-^) {t>T), (44) 

where T corresponds to the time /3(t) = to. When QI3T /m ^ 1, the in- 
teraction along the Trotter axis T) given by Eq. (19) is approximately 
given as 

/(;3,r) ~ -iVlog(^) = Nrl - iVlog(^). (45) 

From this equation, it is enough to set large enough Fq and < rp in 
order to prepare the schedule /*. In other words, independent simulated 
annealing is performed until r and after that we decrease quantum field to 
obtain the better solution than the conventional simulated annealing. 



6. Results 

We perform numerical experiment for the following three problems by pro- 
posed quantum annealing and simulated annealing for comparison. We pre- 
pare the number of replicas m = 50 in the whole experiments. The initial 
inverse temperature and the initial quantum field arc set to be /3o = 0.2m 
and Fo = e^^^, respectively. Here the ratio of changing temperature is set 
to be Tfj = 1.05. In the simulated annealing, we prepare independent 55 
samples and use the same initial temperature and the ratio of changing 
temperature as the case of quantum annealing. We study three problems 
as follows: 

(a) Evaluation of mixture of Gaussian by using MNIST data^° 

(Q = 30,Ar = 5000) 

(b) Evaluation of latent Dirichlet allocation^^ by using Reuters data*^ 

(Q = 20,iV = 2000) 

(c) Evaluation of latent Dirichlet allocation by using NIPS corpus^"^ 

(Q = 2Q,N = 1000) 

In all of the experiments, we fix the ratio of changing the inverse tempera- 
ture r/3 = 1.05. 

Figure 6 denotes the time development of the minimum energy in all of 
the layers for each problems. The circles in Fig. 6 represent results obtained 
by simulated annealing for comparison. The triangles, asterisks, and squares 
denote obtained results by proposed quantum annealing for rr = 1.02, rr = 
1.10, and rr = 1.20, respectively. The schedule for rr = 1-02 corresponds 
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40 60 80 100 40 60 80 100 40 60 80 100 

iteration iteration iteration 



(a) (b) (c) 

Fig. 6. Time development of the minimum energy in all of the layers. The circles rep- 
resent results obtained by simulated annealing for comparison. The triangles, asterisks, 
and squares denote obtained results by proposed quantum annealing for rr = 1.02, 
rp = 1.10, and rp = 1.20, respectively, (a) Evaluation of mixture of Gaussian by using 
MNIST data (Q = 30, iV = 5000). (b) Evaluation of latent Dirichlet allocation by using 
Reuters data {Q = 20, Af = 2000). (c) Evaluation of latent Dirichlet allocation by using 
NIPS corpus (Q = 20, = 1000). 

to schedule depicted /2 in Fig. 5 whereas the schedules for rp = 1.10 and 
rp = 1.20 correspond to schedule depicted /* in Fig. 5. From Fig. 6 it is 
clear that we can obtain the better solution than simulated annealing if we 
adopt the schedule /*. 

7. Conclusion 

In this paper, we developed hybrid type of the quantum annealing for clus- 
tering problems. When we apply the standard quantum annealing for these 
problems, we face on the difficulty of the domain wall problem. Since the 
number of the same division is the domain wall problem gets more se- 
rious for large Q. To avoid such a problem, we introduced new parameter 
"modified correlation function" instead of the standard correlation func- 
tion. We investigated the best schedule of changing the thermal fluctuation 
and the quantum fluctuation simultaneously. We first apply strong quan- 
tum field and then we obtain the metastable states. At the second step, 
we can obtain the better solution than the simulated annealing by decreas- 
ing the quantum field. Actually, we succeeded to obtain the better solution 
for clustering problems than the simulated annealing. We expect that the 
proposed schedule of changing the thermal fluctuation and the quantum 
fluctuation is generally efficient for other type of problems. However, it is 
an open problem when to use the quantum annealing. To solve this prob- 
lem, we should study the efficiency of quantum annealing for the problems 
where the difficulty of the problem can be controlled. 
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Appendix A. Implementation Method of New Kind of Fluctua- 
tion 

In this paper, we studied a quantum effect on clustering problems which are 
expressed by the Potts model. Relationship between a phase transition and 
a performance of quantum annealing is very important, although we did not 
mention it in this paper. Many researchers have studied the order of the 
phase transition of a given problem and concluded whether the quantum 
annealing is efficient or not for the problems. ^'^"^'^^"^^ It is one of main 
topics on quantum annealing. 

Suppose there is a transition point at halfway of a control parameter 
such as magnetic field. If a first-order phase transition occurs, the quantum 
adiabatic computation, in principle, does not obtain a good solution because 
of a level-crossing problem. There is a trivial example of level-crossing prob- 
lem: the ferromagnetic Ising model with longitudinal magnetic field h^. In 
this model, level crossing occurs at = 0. The initial state is set to be a 
ground state of negative h^. When we sweep from negative to positive, 
the state cannot follow the adiabatic limit of the state at all. If a second- 
order phase transition occurs, on the other hand, the growth of correlation 
length does not follow for finite speed of sweeping of control parameter. 
The equilibrium value of correlation length diverges at the second-order 
phase transition point. In practice, the quantum annealing does not suc- 
ceed to obtain the best solution for systems in which the second-order phase 
transition occurs. If there is no phase transition, the quantum annealing is 
expected to find the best solution. 

Our purposes are to control the order of phase transition and, what 
is more, to erase the phase transition by adding some kind of fluctuation. 
Recently, we constructed a model in which the order of the phase transi- 
tion can be changed by controlling a fluctuation.'''^"'*^ This model is called 
the Potts model with invisible states. We found that a first-order phase 
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transition is driven by the effeet of invisible states (invisible fluctuation) 
in the ferromagnetic Potts model with invisible states. Although the in- 
visible fluctuation itself seems to be getting worse for quantum annealing 
from the above discussion, the invisible fluctuation changes the order of the 
phase transition without changing an essence of problems. We expect that 
there are fluctuations which wipe a phase transition. Thus, to introduce 
new kinds of fluctuation is important for optimization problems. In this 
section, we introduce the concept of invisible fluctuation. 

We flrst consider the standard Potts model. The Hamiltonian of this 
model is given by 

^standard = ^ ^ Jij^ai,aj^ CTj = 1) ' ' ' iQi (-^•-'-) 

{i,0)eE(G) 

where E{G) denotes the set of edges of given graph G. Eq. (A.l) is called 
the; Q-statc Potts model. Here we assume Q is a natural number. Suppose 
we consider the ferromagnetic case i.e. Jij = —J for all € E(G). 

A second-order phase transition occurs when Q < 4 whereas a first-order 
phase transition occurs when Q > 4 on two-dimensional lattice. It is inter- 
esting that the order of the phase transition of the standard ferromagnetic 
Potts model can be changed by the number of states Q. The ground state 
of this model is that all of the spins have the same value. The number of 
ground states is Q. Then, the phase transition accompanies spontaneous 
Q-fold symmetry breaking. The standard Potts model has been regarded 
as the standard model not only in statistical physics but also in wide area 
of science. 

Wc consider the Hamiltonian of the Potts model with invisible states as 
follows: 

Q 

"Winv = ^ JijSai,aj ^ Sai,a, ^-j = 1, • • • ,Q + R. (A.2) 

{i,j)eE{G) a=l 

This model is called the {Q ,R)-sta.te Potts model. "^^"^^ Suppose we consider 
the case for Jjj = — J for all {i,j) € E{G) for simplicity as the previous 
example. If and only if 1 < ctj = aj < Q, interaction — J works. Thus, the 
number of ground states is Q. Note that if i? = 0, this model is equivalent to 
the standard ferromagnetic Potts model. Hereafter we call the states where 
1 < < Q "colored states" whereas the states where Q+l<ai<Q + R 
"invisible states". 

Here we consider two spin system. The number of excited states of the 
standard ferromagnetic Potts model given by Eq. (A.l) is — Q. On the 
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other hand, the number of excited states of the (Q,i?)-state Potts model 
given by Eq. (A.2) is _ g + 2QR + . The (Q,i?)-state Potts model 
does not change the number of degeneracy of the ground states. However the 
number of excited states are different. As the number of sites TV increases, 
since density of state changes, it is expected that nature of phase transition 
changes. 

The order parameter of the ((5,i?)-state Potts model is defined as 



1 ^ 



i=l 

where e" (a = 1, • • • ,Q) represents Q unit vectors pointing in the Q sym- 
metric direction of a hypertetrahedron m. Q — 1 dimensions. It should be 
noted that the Kronecker's delta can be represented by using as follows: 

1 + (Q - l)e" • 

Oa,l3 = Q • (A.4j 

The definition of order parameter is the same as that of the standard fer- 
romagnetic Q-state Potts model. The phase transition accompanies Q-fold 
symmetry if a phase transition takes place in this model. We investigated 
this model by mean-field analysis and Monte Carlo simulation. ''^"^^ In these 
papers, we concluded that the invisible states drive the first-order phase 
transition and a phase transition with Q-fold symmetry breaking occurs at 
finite temperature. 

Before conchiding this section, we discuss why a first-order phase tran- 
sition is driven by the invisible fluctuation. The Hamiltonian given in 
Eq. (A.2) can be transformed exactly by comparing the partition function 
as follows: 

Q N 

^rfv= E ^■^n,r,1^5n,c«-riogrE(5..,o, (A.5) 

{i,j)eE{G) a=l i=l 

Ti = 0,l,--- ,Q, (A.6) 

where T represents a temperature. Here we rename the label of the invisible 

states from Q + 1 < (Ji < Q + R to Ti = 0. The second term means chem- 
ical potential of the invisible states. The (Q,i?)-state Potts model can be 
mapped onto the annealed diluted Potts model whose chemical potential 
depends on temperature linearly. As we change temperature, the chemi- 
cal potential is varied. This concept is similar with our hybrid quantum 
annealing method. It should be noted that temperature-dependency of the 
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chemical potential comes from the number of invisible states, in other words, 
entropy of the invisible states. 

As mentioned above, the invisible fluctuation itself is inefficient for opti- 
mization problem. However the quantum annealing expects to be a powerful 
method by adding a now fluctuation - quantiim fluctiiation. In a similar 
way, it is possible that there is a "good" fluctuation for optimization prob- 
lems. In this section, we have considered the effect of invisible fluctuation. 
The invisible fluctuation is one of "cntropic fluctuation" . The order of phase 
transition is decided by the density of states. Then, such a entropic fluctu- 
ation is expected to wipe a phase transition. We believe that an entropic 
fluctuation which is constructed as the invisible fluctuation makes some 
advantages for optimization problems. 
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